Predicting Depression From Smartphone Behavioral Markers Using Machine Learning Methods, Hyperparameter Optimization, and Feature Importance Analysis: Exploratory Study

نویسندگان

چکیده

Background Depression is a prevalent mental health challenge. Current depression assessment methods using self-reported and clinician-administered questionnaires have limitations. Instrumenting smartphones to passively continuously collect moment-by-moment data sets quantify human behaviors has the potential augment current for early diagnosis, scalable, longitudinal monitoring of depression. Objective The objective this study was investigate feasibility predicting with quantified from smartphone sets, identify that can influence Methods Smartphone 8-item Patient Health Questionnaire (PHQ-8) assessments were collected 629 participants in an exploratory over average 22.1 days (SD 17.90; range 8-86). We 22 regularity, entropy, SD behavioral markers data. explored relationship between features correlation bivariate linear mixed models (LMMs). leveraged 5 supervised machine learning (ML) algorithms hyperparameter optimization, nested cross-validation, imbalanced handling predict Finally, permutation importance method, we identified influential Results Of at least 56 countries, 69 (10.97%) females, 546 (86.8%) males, 14 (2.2%) nonbinary. Participants’ age distribution as follows: 73/629 (11.6%) aged 18 24, 204/629 (32.4%) 25 34, 156/629 (24.8%) 35 44, 166/629 (26.4%) 45 64, 30/629 (4.8%) 65 years over. 1374 PHQ-8 assessments, 1143 (83.19%) responses nondepressed scores (PHQ-8 score <10), while 231 (16.81%) depressed ≥10), based on cut-off. A significant positive Pearson found screen status–normalized entropy (r=0.14, P<.001). LMM demonstrates intraclass 0.7584 association (β=.48, P=.03). best ML achieved following metrics: precision, 85.55%-92.51%; recall, 92.19%-95.56%; F1, 88.73%-94.00%; area under curve receiver operating characteristic, 94.69%-99.06%; Cohen κ, 86.61%-92.90%; accuracy, 96.44%-98.14%. Including group gender predictors improved performances. Screen internet connectivity most Conclusions Our findings demonstrate indicative be unobtrusively sensors’ Traditional augmented diagnosis monitoring.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying Model-Based Optimization to Hyperparameter Optimization in Machine Learning

This talk will cover the main components of sequential modelbased optimization algorithms. Algorithms of this kind represent the state-of-the-art for expensive black-box optimization problems and are getting increasingly popular for hyper-parameter optimization of machine learning algorithms, especially on larger data sets. The talk will cover the main components of sequential model-based optim...

متن کامل

Using Machine Learning for Exploratory Data Analysis

This tutorial will introduce attendees to fundamental concepts in the clustering and dimensionality reduction fields of unsupervised machine learning. Attendees will learn about the assumptions algorithms make and how those assumptions can cause the algorithms to be more or less suited to particular datasets. Hands-on interaction with machine learning algorithms on real and synthetic data are a...

متن کامل

Combining Feature and Algorithm Hyperparameter Selection using some Metalearning Methods

Machine learning users need methods that can help them identify algorithms or even workflows (combination of algorithms with preprocessing tasks, using or not hyperparameter configurations that are different from the defaults), that achieve the potentially best performance. Our study was oriented towards average ranking (AR), an algorithm selection method that exploits meta-data obtained on pri...

متن کامل

Hyperparameter Search in Machine Learning

Abstract We describe the hyperparameter search problem in the field of machine learning and discuss its main challenges from an optimization perspective. Machine learning methods attempt to build models that capture some element of interest based on given data. Most common learning algorithms feature a set of hyperparameters that must be determined before training commences. The choice of hyper...

متن کامل

Predicting the Subcellular Localization of Human Proteins Using Machine Learning and Exploratory Data Analysis

Identifying the subcellular localization of proteins is particularly helpful in the functional annotation of gene products. In this study, we use Machine Learning and Exploratory Data Analysis (EDA) techniques to examine and characterize amino acid sequences of human proteins localized in nine cellular compartments. A dataset of 3,749 protein sequences representing human proteins was extracted ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Jmir mhealth and uhealth

سال: 2021

ISSN: ['2291-5222']

DOI: https://doi.org/10.2196/26540